Speech separation based on the GMM PDF estimation

نویسندگان

Xiao Yu

Guangrui Hu

چکیده

In this paper, the speech separation task will be regarded as a convolutive mixture Blind Source Separation (BSS) problem. The Maximum Entropy (ME) algorithm, the Minimum Mutual Information (MMI) algorithm and the Maximum Likelihood (ML) algorithm are main approaches of the algorithms solving the BSS problem. The relationship of these three algorithms has been analyzed in this paper. Based on the feedback network architecture, a new speech separation algorithm is proposed by using the Gaussian Mixture Model (GMM) pdf estimation in this paper. From the computer simulation results, it can be concluded that the proposed algorithm can get faster convergence rate and lower output Mean Square Error than the conventional ME algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Voice activity detection using global soft decision with mixture of Gaussian model

An improvement on the voice detection algorithm using global soft decision (GSD) is made in this paper. In GSD method, the speech and noise are modelled by the presumed probability density function, e.g. Gaussian pdf. We propose that the estimation and modelling of the signal is done in the domain of filterbank output which widely used in most speech processing applications. Since the output of...

متن کامل

Combination of temporal domain SVD based speech enhancement and GMM based speech estimation for ASR in noise - evaluation on the AURORA2 task -

In this paper, we propose a noise robust speech recognition method by combination of temporal domain singular value decomposition(SVD) based speech enhancement and Gaussian mixture model(GMM) based speech estimation. The bottleneck of GMM based approach is a noise estimation problem. For this noise estimation problem, we incorporated the adaptive noise estimation in GMM based approach. Furtherm...

متن کامل

Perceptual postfilter estimation for low bit rate speech coders using Gaussian mixture models

A novel perceptual postfilter is introduced. For each frame, the filter gains, z, are estimated given a vector, y, of the quantized LSFs and the long-term prediction gain of the corresponding frame. The proposed perceptual postfilter is derived from an optimal MMSE estimator, i.e. the estimated gain vector is ẑ = E{z|y}. The MMSE estimator is based on the conditional pdf of z given y, which is ...

متن کامل

کاربرد الگوریتم جداسازی کور منابع در جداسازی سیگنال‌های گفتار و موسیقی

In this paper, the application of the Independent Component Analysis In this paper, the application of the Independent Component Analysis technique in speech-music separation is discussed. The separation algorithm is in the time domain. It needs the score function estimation to minimize the mutual information. For estimating score function, sufficient samples of the mixed (speech-music) signals...

متن کامل

Estimation of Sound Source Direction Using Parabolic Reflection Board

This paper presents a new sound-source-direction estimation method using only a single microphone with a parabolic reflection board. In our previous work [1], we proposed GMM (Gaussian Mixture Model) separation for estimation of the sound source direction, where the observed (reverberant) speech is separated into the acoustic transfer function and the clean speech GMM. However, the previous met...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

Speech separation based on the GMM PDF estimation

نویسندگان

چکیده

منابع مشابه

Voice activity detection using global soft decision with mixture of Gaussian model

Combination of temporal domain SVD based speech enhancement and GMM based speech estimation for ASR in noise - evaluation on the AURORA2 task -

Perceptual postfilter estimation for low bit rate speech coders using Gaussian mixture models

کاربرد الگوریتم جداسازی کور منابع در جداسازی سیگنال‌های گفتار و موسیقی

Estimation of Sound Source Direction Using Parabolic Reflection Board

عنوان ژورنال:

اشتراک گذاری